Corpus: san_wikipedia_2014_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 4210 स-
2 4068 प-
3 2768 व-
4 2624 क-
5 2505 अ-
Top Character Bigrams
word rank frequency n-gram
1 1858 प्-
2 1242 वि-
3 987 स्-
4 652 नि-
5 604 सम-
Top Character Trigrams
word rank frequency n-gram
1 1834 प्र-
2 506 स्व-
3 305 निर-
4 286 श्र-
5 223 अन्-
Top Character 4-Grams
word rank frequency n-gram
1 391 प्रा-
2 278 प्रत-
3 217 निर्-
4 196 सर्व-
5 154 श्री-
Top Character 5-Grams
word rank frequency n-gram
1 165 प्रति-
2 99 भारती-
3 99 विद्य-
4 98 पूर्व-
5 88 विश्व-
311 msec needed at 2021-04-29 00:05